CPU/CUDA: fix GQA mul mat back, add CUDA support #11380
+157
−62
We went looking everywhere, but couldn’t find those commits.
Sometimes commits can disappear after a force-push. Head back to the latest changes here.